Action Understanding as Inverse Planning Appendix
نویسندگان
چکیده
This section formalizes the encoding of an agent’s environment and goal into a Markov decision problem (MDP), and describes how this MDP can be solved efficiently by algorithms for rational planning. Let π be an agent’s plan, referred to here (and in the MDP literature) as a policy, such that Pπ(at|st, g, w) is a probability distribution over actions at at time t, given the agent’s state st at time t, the agent’s goal g and world state w. This distribution formalizes P (Actions|Goal, Environment), the expression for probabilistic planning sketched in the main text. The policy π encodes all goal-dependent plans the agent could make in a given environment. We assume that agents’ policies follow the principle of rationality. Within a goal-based MDP, this means that agents choose action sequences that minimize the expected cost to achieve their goals, given their beliefs about the environment. Let Cg,w(a, s) be the environmentand goal-dependent cost to an agent of taking action a in state s. The expected cost to an agent of executing policy π starting from state s is given by the agent’s value function, which sums the costs the agent is expected to incur over an infinite horizon:
منابع مشابه
Urban expansion as a driver of biodiversity loss: Integrating biodiversity in urban planning in African context
Africa has high biodiversity and is rapidly urbanizing. However, there is limited understanding of how urban expansion in Africa is likely to affect its habitats and biodiversity. Little urban ecological research has been done in Africa. This study needs to think ahead as Africa move into the “urban age” it is critical to inform the public on the importance of urban environment...
متن کاملAction understanding as inverse planning.
Humans are adept at inferring the mental states underlying other agents' actions, such as goals, beliefs, desires, emotions and other thoughts. We propose a computational framework based on Bayesian inverse planning for modeling human action understanding. The framework represents an intuitive theory of intentional agents' behavior based on the principle of rationality: the expectation that age...
متن کاملکنش جمعی؛ نقطه اتصال تفکر نهادگرا و نگرش برنامهریزی ارتباطی
Institution is the collective action in control, liberation, and expansion of individual action; so that individuals are identified and realized through their participation in institutions. In the institutionalist perspective, the question of planning goes beyond the dichotomy of public-private sectors, and is transmuted from being an individual or interpersonal activity to an aspect of govern...
متن کاملIntersectoral Planning for Public Health: Dilemmas and Challenges
Background Intersectoral action is often presented as essential in the promotion of population health and health equity. In Norway, national public health policies are based on the Health in All Policies (HiAP) approach that promotes whole-of-government responsibility. As part of the promotion of this intersectoral responsibility, p...
متن کاملDoes the Appendix Location Change During Pregnancy?
Background: Early diagnosis of appendicitis is important during pregnancy. Unlike omentom, cecum and appendix are not likely to be displaced by the growing uterus, so the concept of position change of the appendix during pregnancy is questionable. Objective: To determine the anatomical location of appendix during pregnancy. Methods: In a prospective study from October 1995 to March 1999, 291 w...
متن کامل